Llama 3.3 70B Instruct Quantized.w4a16
A quantized and optimized model based on the Meta-Llama-3.1 architecture, supporting multiple languages, suitable for business and research scenarios, while reducing resource requirements and maintaining high performance.
Large Language Model
Transformers Supports Multiple Languages